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Abstract 

The applications of the general formulae of channel capacity developed 
in the quantum information theory to evaluation of information transmis- 
sion capacity of optical channel are interesting subjects. In this review 
paper, we will point out that the formulation based on only classical- 
quantum channel mapping model may be inadequate when one takes into 
account a power constraint for noisy channel. To define the power con- 
straint well, we should explicitly consider how quantum states are con- 
veyed through a transmission channel. Basing on such consideration, we 
calculate a capacity formula for an attenuated noisy optical channel with 
genreral Gaussian state input; this gives certain progress beyond the ex- 
ample in our former paper Jl3| |. 

1 Introduction 

During about twenty years, the information theory for quantum channel has 
been devised It is called quantum information theory. The most fundamental 
result in it is the formula of channel capacity, which might be an interesting 
subject for optical communication issue; if one wants to know the ultimate 
capability of information transmission, one has to know the channel capacity. 

Here we introduce short survey of a channel model and its capacity theory Q . 
Let {|cv m )}, m = 1, 2 be binary letter states, whose state overlap is k — («i|a2). 
By n-th extension, we choose M codeword states {I'i/'i}; ' " " > IV'm)} {M < 2") 
from the 2™ possible sequences of length n, and use them with respective a 
priori probabilities {tti, ■ ■ ■ ,7r M }, where \tpi) = \a mi («)} ® \a„ l2 (i)) |®a m3 (i))(g) 
• • • <8> |am„(i))- We can regard a codeword state as one signal state. Since the 
codeword states are linearly independent, they span the M dimensional Hilbert 
space. Then the optimization problem of quantum decoding is reduced to the 
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detection problem for Af-ary no coded quantum signals. The above statements 
are the definition of the quantum coding and decoding for Shannon information. 
Here, the decoding means abstract representation of optical receivers, which are 
described by so called positive operator valued measure(POVM). The decoding 
processes can be classified into two classes in general. One of them is decoding 
based on "individual separate measurement". It means that each bit of codeword 
is measured and a received codeword is decoded by the data processing based on 
n measured variables. However, the super additivity is not allowed by decoding 
based on the individual separate measurement |l8). This decoding operator (or 
detection operator) is described as follows: 

n, = ®Liri fe w (i) 
?(*).,.,. 

where is the optimum detection operator for letter states. The other is de- 
coding based on entangled or collective measurement, which is called "entangled 
decoding". In the latter case, the codeword state is regarded as single state and 
a set of M detection operators for M-ary no coded quantum states is applied 
to decide them. Therefore, one can obtain only M values of detection for M 
codewords in the decoding process. In this model, there is a bound for capacity, 
so called Holevo bound. His general formula involves also the mixed state case. 
The detailed discussion is given in the later sections. Then, for pure state case, 
Hausladen, Jozsa, Schumacher, Westmoreland, and Wooters|Q proved that the 
Holevo bound for the information transmission by pure state is really channel 
capacity. That is, the zero error channel capacity C is defined by lim n ^oo C n /n 
and it is given by the maximization of von Neumann entropy with respect to a 
priori probability for an ensemble of signal states. 



Recently, Holevo[ll , and Schumacher- Westmoreland [|19| proved that the 
Holevo bound is also the channel capacity in the general case (mixed state 
case). Finally, Holevo gave the formula for the continuous alphabet 0|. As a 
result, the capability of information transmission increases by design of coding 
and decoding for nth extension. Then a fundamental property, super additivity 
of channel capacity by extended codes, C n + C m < C n + m , was found. 

In this paper, we survey the general scheme of the theory of channel capacity, 
and show how to apply them to optical communication process. Finally, the 
general capacity formula for optical channel with energy loss and back ground 
noise is given, which corresponds to Shannon formula: C = log(l + S/N). 

In Section 2, we present a quantum model of optical communication, "at- 
tenuated noisy channeT . In preparation for calculating the capacity of such a 
channel, we survey the results about the channel coding theorem in Sections 3 
and 4: In Section 3, we consider the " classical- quantum channel modeT , which is 
a generalization of attenuated noisy channel, and give the general formula of its 
capacity. To calculate the channel capacity, we need to solve an optimization 
problem included in the general capacity formula (see the formulation (p6|)). 
In Section 4 we calculate the channel capacity for quantum Gaussian channel, 
solving the optimization problem. Modifying this calculation, we obtain the 
capacity of the attenuated noisy channel in Section 5, where the formulae (p3[) 
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and ( p6[ ) are main results of this paper. In Section 6 we discuss a discretization 
of quantum continuous channel. 

2 Quantum Model of Optical Communication 

We start with presenting a quantum model of optical communication, which 
is the main subject of our interest. In our model, one mode Bosonic states, 
which the transmitter outputs, are conveyed through an attenuation channel 
with classical noise. We can formulate this model as follows. 

(i) transmitted Bosonic states 

Let a be an annihilation operator on a Hilbcrt space Ti.. We take as an input al- 
phabet A the complex plane C or nonnegative integer Z(| , and assume that the 
transmitter sends a Bosonic state p(p) corresponding to a letter p 6 A. In par- 
ticular our concern is concentrated on two examples: (a) p{p) — T>(p)p(0) with a 
squeezed state p(Q) and the unitary displacement operator T>(p) = exp(/ia^— pa), 
(b) p(n) are number states |n.)(n| (n G 

(ii) linear attenuator with classical noise 

The linear attenuator with coefficient k < 1 and Gaussian noise with variance 
N c is described by the transformation 

a' = ka + y/l - k 2 a + £, (2) 

in the Heisenberg picture. In the equation (|^), a$ is an annihilation operator in 
another mode in the Hilbert space TIq of an "environment" and £ is a complex 
random variable with zero mean and variance N c . We assume that the envi- 
ronment is initially in the vacuum state. We denote by T the corresponding 
transformation of states in the Shrodinger picture: Trp(p)a' = TrT[p(p)]d. 

(iii) detection process 

Consider the code of size M, {u\,U2, «m}, consisting of codewords of length 
u, Ui — (p^, ...,Pi n ), where pi j is selected from the continuous input alphabet 
A = C or the discrete one A = Zq • Then the codeword Ui is related to a prod- 
uct state p{p ll ) <g> • ■ • <8 p{p in ) G H ® ■ ■ ■ ® U = H® n , where p{p t ^) = T[p{p i:j )]. 
This correspondence gives a channel mapping stated in Section 0. A detection 
process is given by a detection operator n,; , which is a positive operator- valued 
measure (POVM) on TL® n fo) defined as 

% > 0, Vi, 

M 

The POVM represents a measurement process and decision for a signal to be 
Ui based on the measurement result. Using POVM, the conditional probability 
of the output Uj, given the input was Ui, is obtained as, 

P(u j \u i ) = Trp Ui fl j . (4) 
3 



(3) 



How to define information quantities for such a channel, is shown in the next 
section using a more general formulation. 



3 Coding Theorem of Quantum Channel for Shan- 
non Information 

In this section we survey the coding theorem for classical-quantum channels, 
where Shannon information is conveyed by quantum states. 

3.1 Finite alphabet system 

3.1.1 Holevo- Schumacher- Westmoreland theorem 

Let Ji be a Hilbert space. The classical-quantum channel (coding channel) with 
discrete alphabet A — {1, ..,a} consists of the mapping i — * pi from the input 
alphabet to the set of density operators in TL. The input is described by an a 
priori probability distribution 7r = {7^} on A. A quantum detection process 
(decoding channel) is described by POVM II = {H,} on H. Like Eq. (|) the 
conditional probability of the output j, given the input was i, is obtained as, 

P(j\i) = Trpiflj, (5) 

and the Shannon's mutual information is given by 

t(.,n) = E£'^B't( £l X) )- <6) 

3 1 

Moreover let us consider the product channel in the tensor product Hilbert 
space J{® n = Ti ® ■ ■ ■ ® Ti with the input alphabet A n consisting of words 
u = (ii,...,i n ) of length n, with density operator 

Pu = P n ® • • • ® Pi n ■ (7) 

If n is a probability distribution on A n and II is a POVM on 7i®", we can define 
the information quantity I n (n, II) by a formula similar to (|^). Now let us define 

C n =sup7 n (7r,n). (8) 
7r,n 

Then we have the property of super additivity 

C n + C m < C n +mj (9) 

and hence the following limit exists: 

C= lim C n /n. (10) 
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This limit gives a definition of capacity of the initial channel. Here it should 
be emphasized that in the definition of this quantity we employ an entangled 
measurement. The use of such measurement causes the superadditivity, which 
is characteristic of quantum system. In contrast, in a semi classical case, we 
consider individual separate measurement of the form (Q), and carry out error 
correction based on data processing of measured value. Such a detection strategy 
never produces the super additivity, but only achieves C\. 

Using the von Neumann entropy, which is defined as H{p) = — Trplogp for 
a density operator p, we can obtain a simple formula of the capacity: 



Theorem (ll, 1 



The capacity of arbitrary signal states pi, having finite entropy H(pi), is given 
by 

C = lim Cn/n = maxAiJ(Tr), (11) 

n— >OC 7T 

where 

AH(n) = H (5>^) -5>iH(Pi). (12) 



In general the computation of the quantity C is very difficult, but, fortu- 
nately, if the signal states pi have a certain symmetry, the analytical solution 
can be obtained: Let us consider signal states given as 

Pi = V i ~ 1 p 1 V^- 1 (13) 

where V^V = VV^ = V M = I. Then it is shown |H| that the capacity is 
achieved by a uniform distribution on a priori probabilities. In particular, when 
pi is a pure state represented as p\ = \ip)(ip\, the capacity is calculated as, 

M 

C = -£A>gA„ (14) 
j'=i 

where Xj is 

k=l V / 



3.1.2 C\ and Accessible information 

As described in Sec. |3.1.l| , most essential property of quantum theory for capac- 
ity is the super additivity. For research of coding scheme achieving the capacity, 
we have to clarify properties of the super additivity. As a first step, we consider 
accessible information, which is defined to be 

4c,n(7T) = SUpI n (7T,n), (16) 

n 



5 



for fixed a priori distribution tt. A necessary condition for quantum measure- 
ment ft achieving the accessible information ZaciW was given as follows JlOf: 



(17) 



where 



(18) 



pi- 



Since the above equation provides only a necessary condition for detection op- 
erators, we cannot, in general, solve the problems. From another point of view, 
Davies proved theorems [Q of conditions for optimum detection operators which 
give accessible information. His theorems require that one has to take into ac- 
count a number of detection operator J corresponding to a size of detection 
scheme to get an accessible information. That is, d < J < d , where d is the 
dimension of Hilbert space of letter states. 

Now let us consider C\. Although to derive C\ is very difficult, Davies 's 
theorems play an important role to search it actually. Based on these theories, 
Levitin(l(|, Fuchs and Caves and Fuchs and Peres j(| gave some examples of 
C\ as almost final result, and they clarified an importance of this problem in a 
quantum information theory. On the other hand, BanQ and OsakiJlTj proved 
that if the signal states are group covariant, then the square root measurements 
or minimax solutions in quantum detection theory satisfy Holevo's necessary 
condition for information criterion, and finally Osaki numerically [fl7| and Ban 
analytically derived C\ of the binary pure states taking all parameters into 
account. The final result: C\ is so simple as follows: 



where H(P C (2)) is entropy for error probability, -P c (2) is the minimum error 
probability in the detection problem for binary states, 



where k is the inner product between two pure states. 

Let us turn the problem into the system with linearly dependent state. If 
there are three states in 2 dimensional Hilbert space, then they are linearly 
dependent. When they have equal angle each other, the channel capacity of 
letter states is C\ = 0.6698, where C\ is given by 7Ti = 7r 2 = 1/2, and 7r 3 = 0. 
However, we have no exact solution for other cases. 

3.2 Infinite alphabet system 

Let us turn to the infinite alphabet system; we present a general formula of 
capacity for a quantum continuous channel according to [ p"3| . 

Take as the input alphabet A an arbitrary Borel subset in a finite-dimensional 
Euclidean space E . Then the quantum continuous channel is described by a 



Ci =l-H(P e (2)) 



(19) 



P c (2)= 1-(1- K 2 )^ /2 



(20) 
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weakly continuous mapping p — > from the input alphabet A to the set of 
density operators in Ti., where we assume that the states p(p) have a finite von 
Neumann entropy H{p{p)) = — Trp([x) logp(fx), moreover 

H{ P {p)) < oo. 

Like the discrete case, we treat the product memoryless channel in the 
Hilbert space "ft®" = TC <&■■■(& H (n copies). Then the signal m = (pi l: ...,p in ) 
is related to a density operator p{pti 1 ) ® ■ ■ ■ ® p(pi n )- In a similar way as the 
classical case, we impose the power constraint 

/(Wi) + — + /(WJ<^ (21) 

on the signals Ui = (p^, ...,/ij n ), where / is a continuous positive function on 
£. In particular, we denote by V\ a set of probability distributions 7r on A 
satisfying 

/ f(x)n(dx) < E. (22) 

Without the constraint we cannot obtain anything but meaningless results, that 
is, arbitrary high transmission rates can be achieved with arbitrary low error 
probability with essentially no coding. 

The theory of capacity of continuous channel was established by Holevo. Let 
us sketch the outline of the theory according to Jl^] . We consider discretization 
of the channel by taking a priori distributions with discrete support 

Tr(dp) = ^ KififH ( 23 ) 

i 

where {f^i} is arbitrary countable collection of points and 

Then the capacity C is defined as follows. 

C = sup AH(tt), (25a) 

AH{n) = H ^ p x K(dxU - j H{p x )n(dx), (25b) 

where V'i is a set of a priori distributions with discrete support satisfying input 
constraint (0). Holevo proved J12| that all rates below the channel capacity are 
achievable, and that under some assumptions the channel capacity C is equal 
to 

sup AH(tt). (26) 
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Figure 1: channel model with transmitter constraint 



So far we have described the theory for channel capacity. As a result, the 
channel coding theorem has been proved for quantum channel which corresponds 
to quantum measurement process. This theorem provides interesting results. 
That is, the channel capacity is exactly equal to the Holevo bound which is 
defined without the model of quantum measurement process, while the capacity 
is defined for channel model of the quantum measurement process as described 
in the equations So if one wants to calculate only channel capacity, 

one needs not the physical channel model. In other word, the capacity is given 
only by the ensemble of prepared states or density operators in front of the 
measurement. Thus, sometimes, in mathematical papers the channel is defined 
by a mapping from classical alphabet to quantum states or density operators; it 
is called classical-quantum channel. However, in order to evaluate the capacity 
of the attenuated noisy channel described in Section |^, we should consider a 
slightly different channel model, which is described in the next subsection. 

3.3 Channel model for describing a power constraint 

We present channel model consisting of three components as shown in Figure U 

(1) coding channel: In the coding channel codewords (/ii, p n ) are related to 
quantum states p(pi) <8> • • • ® p(fi<n) respectively. Thus the coding channel is 
described as classical-quantum channel mapping $ : (fix, //„) — > p(fAi) (8> • • -<8> 

(2) transmission channel: Through the transmission channel quantum states 
p(/Lti)(g>- ■ -(E)p(pn) are conveyed, and transformed to quantum states p(p,i)® - ■ -® 
p{p n ) respectively. Thus the transmission channel is described by completely 
positive map T : p(pi)($- ■ -(^p(pn) — > p{pi)®' ■ -®p{Pn)- (3) detection process. 
This model gives a generalization of the attenuated noisy channel. 

When we consider only (discrete) channel without power constraint, we can 
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apply the general capacity formula ( pi] ) directly to the above model by regarding 
$(") = r'"' o<i>( n ) as the classical-quantum channel mapping. On the other hand, 
for a continuous channel, we should consider a constraint on the average power 
of signals p(/xi) <8> • • • ® p(/i„) that the transmitter outputs; such a constraint is 
called transmitter constraint in the following. Then we have no way to describe 
the power constraint function / in ( pil| ) and ( |22]) b y using only i>("). In other 
words we cannot formulate the optimization in (|25|) only by while we can 
represent AH(tt) as a function of n and $^ ^. This is the reason why we should 
explicitly distinguish coding channel from transmission channel F^ 1 ' in our 
model. Although use of the power constraint on the energy of input signals 
(/ii, ji n ) does not produce such a problem, it is not suitable for evaluation of 
the capacity of the attenuated noisy channel. 

In the attenuated noisy channel the transmission map is parametrized 
by k and N c , which are defined in Section ||. As a result, its channel capacity 
is given as a function of k, N c , and $(°)(0) = p(0) in Section ||. 



4 Channel Capacity Formula for quantum Gaus- 
sian States 

In this section we calcuate the channel capacity, based on the channel mapping 
model fi — > p(fi) with input constraint according to |fl3|| . In particular we treat 
the case where p(p) are quantum Gaussaian states. The results will be applied 
to calculation of capacity for the attenuated noisy channel in Section 0. 



4.1 Quantum Gaussian state 



We introduce a Gaussian density operator, which is defined to be a density 
operator with a characteristic function of the form, 



TrpV(z) = exp 



■ t 1 t 

im z z az 

2 



(27) 



where z is a column 2s-dimensional vector [xi, x s ; yi, .., t/ s ]* and 

s 

V(z) = exp [xjqj + y^}. (28) 

In the characteristic function, m is a column 2s-vector and a is a real symmetric 
2s x 2s matrix, satisfying 



m = TrpR, a — -A = TiRpR*, 



where R = [qi, ..,q s ;pi, ...,p s ] and 

A = 

with identity matrix / and zero matrix O 



O hi 

-hi O 



(29) 
(30) 
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4.2 Calculation of capacity 

The mapping from classical parameter to quantum Gaussian state is, in math- 
ematical paper, called quantum Gaussian channel, which forms an important 



class of the continuous classical-quantum channel described in Section 3.2. Holevo 
and coworkers provided the general formula of capacity of this channel. Take 
as input alphabet A the complex plane C, and as the density operator p(0) a 
Gaussian one with mean 0. Then the quantum Gaussian channel is described 
by the mapping p — ► p(fi), where p(p) = D([x)p(0) with the displacement op- 
erator Here we restrict ourselves to the case where we impose the input 
constraint. The input constraint is given by putting /(/i) = huj\p\ 2 in ( pl| ) and 
©■ 

To carry out calculation of capacity for such a channel, the following two 
properties of quantum Gaussian channel are essentially used |l3| : 

(i) the optimum a priori distribution it in ( p5| ) is Gaussian, and the mixture p^ = 
J p(p)i{{dp) is Gaussian again. Let correlation matrices of a priori distribution 
7r and density operators p and p^ be /3, a and 7 respectively. Then the relation 
7 = a + [3 holds between them. 

(ii) von Neumann entropy of Gaussian density operator p with a correlation 
matrix a is calculated as 

H{p) = ispGHA-M 2 ), (31) 

where G(x 2 ) = (x + 1/2) log(a; + 1/2) - (x - 1/2) log(a; - 1/2). In particular, for 
one mode Gaussian state the von Neumann entropy is given by 

( a qq a pp - (a qp ) 2 \ 
H(P) - G ( ■ (32) 

Thus the capacity of the quantum Gaussian channel can be written as 

C = max is P G(-[A- 1 ( a + /3)] 2 ) - is P G(-(A- 1 a) 2 ), (33) 

where B\ is the convex set of real positive matrices /3, satisfying 

Spe/3 < E, (34) 

where 



£1,1 O 
O 1/21 



(35) 



with ei 1 = diag[w 2 /2, w 2 /2], and s x s zero and identity matrices O, I. 

In the formula (^) the maximization with respect to a priori distribution 
j3 is left unsolved. Holevo showed [H| the explicit calculation of ([33]) for one 
mode quantum Gaussian channel with input constraint. The results are 
[A] If the inequality 



+ (a qp ) 2 < (-) 2 (36) 
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holds, then we have 



C=G 



h 2 u 



i 



E + -(or a 



2 n 11 



G 



a qq a pp - {a qp ) 2 
W 



(37) 



where G(d 2 ) = (d + l/2)log(d + 1/2) - (d - l/2)log(d - 1/2). By putting 
N s = E/fiw and N = Trp(0)tfa = (uj 2 a qq + aP p )/2hu - 1/2, the first term in 
( p7| ) can be written as 



(N + N s + 1) Iog(JV + N s + 1) - (N + N s ) log(JV + JV.). 



(38) 



When the inequality ( pq ) holds, the mixture Pn = f p([i)n(dfi) with the opti- 
mum a priori distribution tt is always equal to the Bose-Einstein distribution 
with mean number of quanta N + N s . Moreover, assuming that p(0) is a co- 
herent state, we can also simplify the second term in Eq. ( |37| ) and obtain the 
capacity as 

C =(N + N s + 1) Iog(JV + N S + 1)-(N + N s ) \og(N + N s ) 
— (N + 1) k>g(N + 1) + N log N 

N„ \ , . / 1 



-log 1 



N + 1 



+ (iv + Agio g i + 



N + N s 



(39) 



[B] If the inequality ( |36| ) does not hold, we have 



C=G 



h 2 u 2 



E+-{u 2 a qq + a pp ) 



G 



^/(u> 2 a qq - a pp ) 2 /4 + io 2 {a qp ) 2 - E 

a qq a pp - (a qp ) 2 
K 2 



(40) 



Note that the capacity under output constraint can be obtained by some 
modifications of the above discussion (see E| ) . 



5 Application to attenuated noisy channel 

The attenuated noisy channel corresponds to optical fiber channel or space chan- 
nel in optical communication system. In this section, we show the capacity for- 
mulae when the communication process has such an attenuated noisy channel. 
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5.1 Gaussian state 



In this subsection we calculate the capacity for the attenuated noisy channel 
formulated in Sec |^. As a first step, we make some preparations for applying 
the results in Section 4.2 to this calculation, 
(i) construction of the mapping fj, — > p(p,) 

Since squeezed state is the most general Gaussian state, we treat squeezed state 
as an example of Gaussian state. The squeezed state p(0) = S(-f) |0) is described 
as a pure Gaussian state with correlation matrix 



a 



yi<i 
gp 



a 



'ii> 



with elements 



2uj 



[cosh 27 — sinh 27 cos 9} 
[cosh 27 + sinh 27 cos 6] 



a qp = —sinh 27 sin 9. 

2 ' 

Then the output state p(p) = T[p(p)] has the characteristic function p 



Trp(p)V(z) = Trp(p)V(kz) ■ exp 



--A(fc,7V c )z*z 



where 



X(k,N c ) = 



1-k 2 



■N, 



(41) 

(42) 
(43) 
(44) 

(45) 

(46) 



This indicates that p(p) — T>(kp)p' Q 'D(kp)\ where p' is the Gaussian state with 
the correlation matrix a = k 2 a + hX(k, N C )I and the mean 0. 
(ii) transmitter co nstr aint 

As shown in Section 3.2, instead of input or output constraint given in Sec- 
tion 4.2, we need to introduce a somewhat different constraint for reasonable 
evaluation of the effect of squeezing. That is, we impose a transmitter constraint 

Spe(5 + 0) < hw (N tr + , (47) 

where fj is a correlation matrix of a priori probability distribution. 
Thus the capacity with the attenuation channel can be written as 



1. 



C= max-S P G(-[A- 1 (a + /3)] 2 ) 
f3eB[ 2 



1 



S P G(-(A- 1 



where B[ is the convex set of real positive matrices /3, satisfying 
Spe/3 < k 2 









— Spea 



(48) 



(49) 
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We can obtain the explicit formula of the capacity by replacing the energy 
~~ ?|) and Q with k 2 [kuj (N tr + 1/2) - Spea]. In the following 

1, a q P = 0, a« > a pp and 



bound E in @ 
we calculate the value of the capacity when u> 



a qq a pp = h /4 hold. By these substitutions, the inequality (p6[) becomes 



and we have 



max{<i m , 5 W } < h(N tr + 1/2), 
£ + Spea =hk 2 (N tr + 1/2) + h\(k,N c ) 



Nr. 



(50) 



(51) 



and 



/iA(fc, N c )){k 2 a pp + h\{k, N c )) 



N, 



k 2 \(k,N c )(2N sq + 

2 

+ k 2 N sq [(l 



< + X(k 1 N c ) 2 
k 2 ) + 2N, 



(52) 



where N sq is mean number of quanta for a transmitted squeezed state p(0), 

From this the capacity is calculated as follows. 
[A] If the inequality (|5(]) holds, we obtain 

G ={k 2 N tr + N c + l) \og{k 2 N tr + N c + 1) 
- (k 2 N tr + N c ) log(k 2 N tr + N c ) 



G 



Nr. 



n 2 



(53) 



+ k 2 N sq [(l-k 2 ) + 2N c 



When we transmit coherent states, that is N sq = 0, the second term in (|5^) is 
simplified and the capacity is given by 



C ={k 2 N tr + N c + 1) log(A: 2 iV tI . + iV c + 1) 

- (k 2 N tr + N c ) \og(k 2 N tr + N c ) 

- (N c + 1) log(7V c + 1) + AT c logiV c . 



(54) 



As G in (53) is a monotonously increasing function of d , we can find that the 
capacity given by ( |53] ) is maximized when there is no squeezing N sq = or 
when there is no noise and attenuation (k — 1 and N c — 0). That is, squeezing 
necessarily decreases the capacity of attenuated noisy channel. 
[B] When the inequality ( |50| ) does not hold, using 



|/2 -E 

~-k 2 \a qq - a pp \/2 - hk 2 (N tr + 1/2) + fc 2 Spa/2 
--k 2 m&x{& qq , & pp } ~ hk 2 (N tr + 1/2), 



(55) 
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Figure 2: capacity for the attenuated noisy channel 



we obtain 



C =G [ k 2 [2N tr + 1} 



X(k,N c ) + k 



2 max{5 99 ,a pp } 



-x(k,N c r- k ^« qq ^ 2 



G 



n 2 



+ k 2 N sq [{l-k 2 ) + 2N c ] 



(56) 



We would like to call these equations (|53|) and (|56|) "quantum Shannon formula" 
based on Holevo Theorem, because these correspond to classical Shannon for- 
mula C = log(l + S/N). In Figure || we present graphs of the capacity with 
respect to squeezing parameter 7, when (k,N c ) = (1,0), {k,N c ) = (0.9,0) and 
(k, N c ) = (0.9, 0.1). These graphs show that the capacity for the non-attenuated 
noiseless channel does not change if squeezing is not too large, while that for 
attenuated noisy channel is necessarily decreased by any squeezing. 

5.2 Number state 

If we employ photon number state as the transmitter state, then alphabets are 
discrete number : n. The density operator is described by 



p = £P(n)|n)<n| 



(57) 
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In the case of no attenuation process, the maximum entropy is given when 

P(n) = l - f-^-V (58) 

K ' 1+ < n > \1+ <n>J K ' 

which is called Bosc-Einstein distribution. < n > is average photon number. 
So the capacity becomes 

C = log(l+ < n >)+ <n> log(l + — - — ) (59) 

<n> 

For such a photon signal, the channel models of attenuation and amplification 
processes were discussed by Shimoda, Takahashi, and Townes|2(|. The channel 
model of the noiseless attenuation process is described by binomial distribution 
as follows: 

77 l 

P{n y \n x ) = x '_ v k n »(l - k) n *- n y (60) 

TLy.\Tl X Tly). 

where n x and n y are input and output photon numbers, respectively. This 
means that if the input state is certain number state |ra x )(n x |, then the output 
is described by 

P^=J2^T^ V ik n «(l-kr*- n «\n y ){n y \ (61) 

^ n y \(n x - n y y. 

If the probability distribution in the input is P(n x ), then the output density 
operator becomes 

Pout = >^ y, P(n x )P(n y \n x )\n y ) (n v \ (62) 



So the channel capacity is 

C = max {H{p out ) - V P(n x )H(p n J} (63) 

-P("x) 

We have no solution in this case. However, if we assume that the input distribu- 
tion is Bose-Einstein distribution, the output distribution is also Bose-Einstein. 
Then, the Holevo-Yuen-Ozawa bound p2]| bound becomes for < n y 1 



AH = -(log < n y > - log(27r) + 1 + 7) (64) 



for < n y ><Si 1 

AH = (1 - 7) < n y > (65) 
where 7 is Euler constant(0.5772...). 
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6 Information theoretical meaning of ultimate 
channel capacity— Binary discretization 



In this section we turn our attention to the idea of discretization |21j| , which 
is introduced by the equation ( ^3|) to treat the continuous channel analytically. 
The discretization means deriving discrete channels from the original continuous 
channel by restricting the number of letters used in the information transmission 
to a finite one. The properties of the original continuous channel can be deter- 
mined by the behavior of all such derived discrete channels. The main purpose 
of this section is to show that the discretization for the quantum continuous 
channel has properties inherent in the quantum system. To this purpose, we 
recall the Gordon's suggestion [0 that the binary quantum counter can extract 
essentially all the information incorporated in a weak light wave. Basing on 
this suggestion, we infer that the binary discretization, restricting the number 
of letters to only two, realizes asymptotically the capacity in the quantum case, 
while the binary discretization necessarily causes some loss of information in the 
classical case. In the following we shall verify this inference by investigating the 
binary discretization of the noiseless coherent state channel. We shall calculate 
the capacity given by the optimum binary discretization, and compare it with 
the capacity of the original continuous channel. Further we shall also consider 
the binary discretization for the maximum mutual information; this binary dis- 
cretization is closely related with the binary model presented in the Gordon's 
suggestion. 

Now let us demonstrate the binary discretization of the coherent state chan- 
nel, input signals are coherent states {\a}} (a £ C) and the corresponding a 
priori distribution n is constrained by 

/ \a\ 2 Tr(d 2 a) < m. (66) 
Jc 

The coherent state channel is a basic example of the quantum Gaussian channel, 
and its capacity is given by putting N s = m and N = in (^) as, 

Cbb = {m + 1) log(m + 1) — mlogrn. (67) 

Let us find the optimum binary discretization for the capacity of the coherent 
state channel, solving the optimization problem: 

C (2) = sup su P H({a,P},{Q,l-Q}). (68) 

{a,/3} Q 

Here the suprema are taken over all binary set of inputs, {a, /?}, and all proba- 
bility assignments Q, 1 — Q satisfying the constraint 

Q\a\ 2 + (1 - Q)\f3\ 2 <m, (69) 

and H({a, [3}, {Q,l — Q}) is the capacity of binary channel with input signals 
{\a), and the corresponding a priori probabilities {Q, 1 — Q}. 
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Figure 3: C^ /Cbe and C\ /Cbe with respect to mean number of quanta 



We can calculate the quantity C^ as 



C (2) 



-2m 



-2m 



-2m 



log(- 



log(- 



-2m 



-) 



(70) 



Here the optimum binary discretization is given by the symmetric binary signals 
{\a), | — a}}. In Figure || the value of C^ /Cbe is plotted with respect to the 
energy constraint m. The figure shows that the binary discretization realizes 
approximately the capacity Cbe in a weak photon case (m < 10~ 2 ). Indeed, by 
applying e~ x « 1 — x and log(l — x) ~ —x to Eqs. ( |67| ) and ( |70|) and neglecting 
the term of m 2 , the following approximation holds, 



Cbe « C^ 1 



-mlogm - 



for m«l. 



(71) 



These results show that the coherent state channel can be simulated by the 
binary discrete channel in a weak photon case. 

On the other hand, the classical continuous channel can not be simulated 
by any discrete channel[?]. In particular the binary discretization does not 
realize the capacity. To achieve the capacity, we must solve a more complicated 
optimization problem. The problem, related with the sphere packing, is still one 
of main topics in the classical information theory. 

We shall consider another optimization, which is concerned with the maxi- 
mum mutual information and formulated as follows, 



C[ = sup sup 

{",/3} Q 



sup I d ({a,f3},{Q,l 
{n Q ,n>} 



Q},{ii a ,iip}). 



(72) 



Here Id is the mutual information of the binary channel with input signals 
{\a), |/3}}, the corresponding a priori probabilities {Q, 1 — Q} and the sig- 
nal detection process given by the positive operator-valued measure (POVM) 
{fl a , II/3}. The optimum POVM for two input letters consists of two detection 
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operators B , and hence we can restrict the number of the detection operators 
in Eq. (72) to two without loss of generality. 

Let us solve the optimization problem. The right-hand side of Eq. (|7^) can 
be divided into two parts as follows, 

C[ 2) = sup sup/ a ({a,/3},Q) (73) 

{",/?} Q 

I a ({a,f3},Q)= sup / d ({a,/n,{Q,l-Q},{ni,n 2 }). (74) 

The accessible information is known (lj to be calculated as 

I a = I a ({a,0},Q) = H(Q)-H(f({a,f3},Q)). (75) 
In this equation 

ff(Q) = -QlogQ-(l-Q)log(l-Q), (76) 
/ = f({a, /?}, Q) = l(i-y/i- 4«2Q(i _ Qfj , (77) 



where 



k 2 = \(a\(3)\ 2 = exp[-|a-/?| 2 ]. (78) 



On the contrary to C^ 2 \ the quantity c[ 2 ^ can be only obtained approxi- 
mately: the optimum a priori distribution is given by 

-mlogm - m 
Qopt{m) w , (79) 

and the optimum binary signals are {\a}, |/3)} satisfying /3 = ka (—1 < fc < 0) 
and 

|a| = Q opt M m ' (8 ° a) 

I/3I 2 = T%^L^- ( 80b ) 
1 - Qopt{m) 

Then The value of is given by I a {Qo P t(m)). 

In Fig H the value of c[ 2 ^ /Cbe is plotted with respect to the energy con- 

( 2) 

straint m. The ratio C{ /Cbe converges to 1 when m —* 0. This substantiates 
the validity of the Gordon's suggestion in theory. However the convergence 
speed is very slow. For example cj 2 ^ /Cbe is 0.82 for m = 1CT 10 , and then the 
capacity takes a very small value: Cbe ~ 2.4 x 10~ 9 . This shows the Gordon's 
suggestion is not correct in practice. We can further obtain the following ap- 

(2) 

proximation of C{ , the value of which is larger than the capacity log(m + 1) 
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but is less than the capacities C^ 2 \ Cbe- 

log(m + 1) < C\ ~ —to log to — to 

< C (2) «C B £ ss -mlogm + m (81) 
for mCl. 

Strictly speaking, should be compared to the maximum mutual information 
C\ of the original quantum continuous channel. It is conjectured C\ gives a 
good approximation of C\ , but we have no way of calculating C\ for the quantum 
continuous channel. 

7 Conclusion 

In this review paper, applications of the general formulae for capacity in the 
quantum information theory to optical communication processes have been in- 
troduced. As a special result, we would like to emphasize that capacity formula 
was calculated for attenuated noisy channel by using Holevo's general theory of 
capacity for continuous alphabet. This formula may provide the capacity for- 
mula in optical field which corresponds to Shannon capacity formula for Gaus- 
sian noise model in long distance microwave communication system. 
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